智能论文笔记

Head Matters: Explainable Human-centered Trait Prediction from Head Motion Dynamics

Surbhi Madan , Monika Gahalawat , Tanaya Guha , Ramanathan Subramanian

分类：机器学习

2021-12-15

我们展示了基本的头部动作单位被称为行为分析的Kinemes，以预测人格和面试特征。将头部运动模式转换为一系列型术语有助于发现表征目标性状的潜在时间签名，从而实现有效和可说明的特征预测。利用Kinemes和面部动作编码系统（FACS）特征来预测（a）在第一次印象上的海洋人格性状候选筛选视频中，（b）在MIT数据集上的面试特征，我们注意到：（1）长期用Kineme序列训练的内存（LSTM）网络表现优于或类似于用面部图像培训的卷积神经网络（CNN）;（2）与Kinemes组合的FACS动作单位（AUS）组合实现了精确的预测和解释，并且（3）预测性能受到朝向头部和面部运动的时间长度的影响。

translated by 谷歌翻译

Learning-based Optimal Admission Control in a Single Server Queuing System

Asaf Cohen , Vijay G. Subramanian , Yili Zhang

分类： (统计)机器学习

2022-12-21

We consider a long-term average profit maximizing admission control problem in an M/M/1 queuing system with a known arrival rate but an unknown service rate. With a fixed reward collected upon service completion and a cost per unit of time enforced on customers waiting in the queue, a dispatcher decides upon arrivals whether to admit the arriving customer or not based on the full history of observations of the queue-length of the system. \cite[Econometrica]{Naor} showed that if all the parameters of the model are known, then it is optimal to use a static threshold policy - admit if the queue-length is less than a predetermined threshold and otherwise not. We propose a learning-based dispatching algorithm and characterize its regret with respect to optimal dispatch policies for the full information model of \cite{Naor}. We show that the algorithm achieves an $O(1)$ regret when all optimal thresholds with full information are non-zero, and achieves an $O(\ln^{3+\epsilon}(N))$ regret in the case that an optimal threshold with full information is $0$ (i.e., an optimal policy is to reject all arrivals), where $N$ is the number of arrivals and $\epsilon>0$.

translated by 谷歌翻译

ReCode: Robustness Evaluation of Code Generation Models

Shiqi Wang , Zheng Li , Haifeng Qian , Chenghao Yang , Zijian Wang , Mingyue Shang , Varun Kumar , Samson Tan , Baishakhi Ray , Parminder Bhatia

分类：机器学习 | 自然语言处理

2022-12-20

Code generation models have achieved impressive performance. However, they tend to be brittle as slight edits to a prompt could lead to very different generations; these robustness properties, critical for user experience when deployed in real-life applications, are not well understood. Most existing works on robustness in text or code tasks have focused on classification, while robustness in generation tasks is an uncharted area and to date there is no comprehensive benchmark for robustness in code generation. In this paper, we propose ReCode, a comprehensive robustness evaluation benchmark for code generation models. We customize over 30 transformations specifically for code on docstrings, function and variable names, code syntax, and code format. They are carefully designed to be natural in real-life coding practice, preserve the original semantic meaning, and thus provide multifaceted assessments of a model's robustness performance. With human annotators, we verified that over 90% of the perturbed prompts do not alter the semantic meaning of the original prompt. In addition, we define robustness metrics for code generation models considering the worst-case behavior under each type of perturbation, taking advantage of the fact that executing the generated code can serve as objective evaluation. We demonstrate ReCode on SOTA models using HumanEval, MBPP, as well as function completion tasks derived from them. Interesting observations include: better robustness for CodeGen over InCoder and GPT-J; models are most sensitive to syntax perturbations; more challenging robustness evaluation on MBPP over HumanEval.

translated by 谷歌翻译

CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context

Yangruibo Ding , Zijian Wang , Wasi Uddin Ahmad , Murali Krishna Ramanathan , Ramesh Nallapati , Parminder Bhatia , Dan Roth , Bing Xiang

分类：自然语言处理

2022-12-20

While pre-trained language models (LM) for code have achieved great success in code completion, they generate code conditioned only on the contents within the file, i.e., in-file context, but ignore the rich semantics in other files within the same project, i.e., cross-file context, a critical source of information that is especially useful in modern modular software development. Such overlooking constrains code language models' capacity in code completion, leading to unexpected behaviors such as generating hallucinated class member functions or function calls with unexpected arguments. In this work, we develop a cross-file context finder tool, CCFINDER, that effectively locates and retrieves the most relevant cross-file context. We propose CoCoMIC, a framework that incorporates cross-file context to learn the in-file and cross-file context jointly on top of pretrained code LMs. CoCoMIC successfully improves the existing code LM with a 19.30% relative increase in exact match and a 15.41% relative increase in identifier matching for code completion when the cross-file context is provided.

translated by 谷歌翻译

Nostradamus: Weathering Worth

Alapan Chaudhuri , Zeeshan Ahmed , Ashwin Rao , Shivansh Subramanian , Shreyas Pradhan , Abhishek Mittal

分类：机器学习

2022-12-08

Nostradamus, inspired by the French astrologer and reputed seer, is a detailed study exploring relations between environmental factors and changes in the stock market. In this paper, we analyze associative correlation and causation between environmental elements and stock prices based on the US financial market, global climate trends, and daily weather records to demonstrate significant relationships between climate and stock price fluctuation. Our analysis covers short and long-term rises and dips in company stock performances. Lastly, we take four natural disasters as a case study to observe their effect on the emotional state of people and their influence on the stock market.

translated by 谷歌翻译

Bayesian learning of Causal Structure and Mechanisms with GFlowNets and Variational Bayes

Mizu Nishikawa-Toomey , Tristan Deleu , Jithendaraa Subramanian , Yoshua Bengio , Laurent Charlin

分类：机器学习 | (统计)机器学习

2022-11-04

Bayesian causal structure learning aims to learn a posterior distribution over directed acyclic graphs (DAGs), and the mechanisms that define the relationship between parent and child variables. By taking a Bayesian approach, it is possible to reason about the uncertainty of the causal model. The notion of modelling the uncertainty over models is particularly crucial for causal structure learning since the model could be unidentifiable when given only a finite amount of observational data. In this paper, we introduce a novel method to jointly learn the structure and mechanisms of the causal model using Variational Bayes, which we call Variational Bayes-DAG-GFlowNet (VBG). We extend the method of Bayesian causal structure learning using GFlowNets to learn not only the posterior distribution over the structure, but also the parameters of a linear-Gaussian model. Our results on simulated data suggest that VBG is competitive against several baselines in modelling the posterior over DAGs and mechanisms, while offering several advantages over existing methods, including the guarantee to sample acyclic graphs, and the flexibility to generalize to non-linear causal mechanisms.

translated by 谷歌翻译

Deep Surrogate Docking: Accelerating Automated Drug Discovery with Graph Neural Networks

Ryien Hosseini , Filippo Simini , Austin Clyde , Arvind Ramanathan

分类：机器学习

2022-11-04

The process of screening molecules for desirable properties is a key step in several applications, ranging from drug discovery to material design. During the process of drug discovery specifically, protein-ligand docking, or chemical docking, is a standard in-silico scoring technique that estimates the binding affinity of molecules with a specific protein target. Recently, however, as the number of virtual molecules available to test has rapidly grown, these classical docking algorithms have created a significant computational bottleneck. We address this problem by introducing Deep Surrogate Docking (DSD), a framework that applies deep learning-based surrogate modeling to accelerate the docking process substantially. DSD can be interpreted as a formalism of several earlier surrogate prefiltering techniques, adding novel metrics and practical training practices. Specifically, we show that graph neural networks (GNNs) can serve as fast and accurate estimators of classical docking algorithms. Additionally, we introduce FiLMv2, a novel GNN architecture which we show outperforms existing state-of-the-art GNN architectures, attaining more accurate and stable performance by allowing the model to filter out irrelevant information from data more efficiently. Through extensive experimentation and analysis, we show that the DSD workflow combined with the FiLMv2 architecture provides a 9.496x speedup in molecule screening with a <3% recall error rate on an example docking task. Our open-source code is available at https://github.com/ryienh/graph-dock.

translated by 谷歌翻译

Learning robotic cutting from demonstration: Non-holonomic DMPs using the Udwadia-Kalaba method

Artūras Straižys , Michael Burke , Subramanian Ramamoorthy

分类：机器人

2022-09-24

动态运动原语（DMP）为编码，生成和调整复杂的最终效应轨迹提供了极大的多功能性。 DMP也非常适合从人类演示中学习操纵技巧。但是，DMP的反应性质限制了其用于工具使用和对象操纵任务的适用性，这些任务涉及非全面约束，例如切割手术刀切割或导管转向。在这项工作中，我们通过添加一个耦合项来扩展笛卡尔空间DMP公式，该耦合术语强制执行一组预定义的非独立约束。我们使用udwadia-kalaba方法获得约束强迫项的闭合形式表达式。这种方法提供了一种干净，实用的解决方案，以确保运行时的限制满意度。此外，约束强迫项的提议的分析形式可实现有效的轨迹优化，但受约束。我们通过展示如何从人类示范中学习机器人切割技能来证明这种方法的有用性。

translated by 谷歌翻译

Applications of Machine Learning in Chemical and Biological Oceanography

Balamurugan Sadaiappan , Preethiya Balakrishnan , Vishal CR , Neethu T Vijayan , Mahendran Subramanian , Mangesh U Gauns

分类：机器学习

2022-09-23

机器学习（ML）是指根据大量数据预测有意义的输出或对复杂系统进行分类的计算机算法。 ML应用于各个领域，包括自然科学，工程，太空探索甚至游戏开发。本文的重点是在化学和生物海洋学领域使用机器学习。在预测全球固定氮水平，部分二氧化碳压力和其他化学特性时，ML的应用是一种有前途的工具。机器学习还用于生物海洋学领域，可从各种图像（即显微镜，流车和视频记录器），光谱仪和其他信号处理技术中检测浮游形式。此外，ML使用其声学成功地对哺乳动物进行了分类，在特定的环境中检测到濒临灭绝的哺乳动物和鱼类。最重要的是，使用环境数据，ML被证明是预测缺氧条件和有害藻华事件的有效方法，这是对环境监测的重要测量。此外，机器学习被用来为各种物种构建许多对其他研究人员有用的数据库，而创建新算法将帮助海洋研究界更好地理解海洋的化学和生物学。

translated by 谷歌翻译

Testing Rare Downstream Safety Violations via Upstream Adaptive Sampling of Perception Error Models

Craig Innes , Subramanian Ramamoorthy

分类：机器人 | 机器学习

2022-09-20

在模拟中测试黑盒感知控制系统面临两个困难。首先，模拟中的感知输入缺乏现实世界传感器输入的保真度。其次，对于合理准确的感知系统，遇到罕见的故障轨迹可能需要进行许多模拟。本文结合了感知误差模型 - 基于传感器的检测系统的替代模型与状态依赖性自适应重要性抽样。这使我们能够有效地评估模拟中现实世界感知控制系统的罕见故障概率。我们使用配备RGB障碍物检测器的自动制动系统进行的实验表明，我们的方法可以使用廉价的模拟来计算准确的故障概率。此外，我们展示了安全指标的选择如何影响能够可靠地采样高概率失败的学习建议分布的过程。

translated by 谷歌翻译